An Approach for Fine-Grained Profiling of Mesh-Based Parallel Programs

نویسندگان

  • Amol S. Deshmukh
  • Qingyuan Liu
  • Karen A. Tomko
چکیده

Characterizing the dynamic behavior of parallel programs in terms of their execution profile helps to understand their behavior and suggest optimization strategies to improve the performance. Traditional event tracing techniques write the profiled data to trace files. Using the traditional approach for fine grained profiling not only yields large unwieldy trace files but often also gives skewed results due to the inaccuracies introduced by the profiling. This paper describes an approach to profile mesh-based parallel programs at a very fine level of granularity by measuring performance metrics at the level of each mesh element. The approach described in this paper is novel in that profile data is associated with mesh elements, not processors, so the profile data can be used, for example, to develop adaptive load balancing policies. A tool implementing the idea described in this paper is developed which provides an easy-to-use C API with Fortran 90 wrappers to enable fine grained profiling of mesh-based parallel applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Profiling Techniques for a Fine-Grained Parallel Language

Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine-grained parallel programming language Split-C, which provides a simple global address space m...

متن کامل

Fine-Grained Parallel Algorithm for Unstructured Surface Mesh Generation

Surface mesh generation is one time-consuming step in preparing an unstructured mesh model. However, its parallelisation attracts little attention. In this study, a fine-grained parallel surface meshing algorithm is proposed by taking advantage of the parallelism within the meshing process of a single face. Compared with the scheme which meshes the faces individually in parallel, the proposed a...

متن کامل

Integrating Domain-Data Steering with Code-Profiling Tools to Debug Data-Intensive Workflows

Computer simulations may be composed of scientific programs chained in a coherent flow and executed in High Performance Computing environments. These executions may present anomalies associated to the data that flows in parallel among programs. Several parallel code-profiling tools already support performance analysis, such as Tuning and Analysis Utilities (TAU) or provide fine-grained performa...

متن کامل

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

Evolution of texture in an ultrafine and nano grained magnesium alloy

The evolution of texture was discussed during the formation of ultra-fine and nano grains in a magnesium alloy severely deformed through accumulative back extrusion (ABE). The microstructure and texture obtained after applying multiple deformation passes at temperatures of 100 and 250°C were characterized. The results showed that after single ABE pass at 100°C an ultrafine/nano grained microstr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004